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Abstract 

We present a deterministic (1 + o(l))-approximation 0(n^/^+°e) + il)^+°e))_time 
algorithm for solving the single-source shortest paths problem on distributed weighted 
networks (the CONGEST model); here n is the number of nodes in the network and 
D is its (hop) diameter. This is the first non-trivial deterministic algorithm for this 
problem. It also improves (i) the running time of the randomized (l-|-o(l))-approximation 
0 (n^/^U^U _|_ T))-time^ algorithm of Nanongkai [STOC 2014] by a factor of as large 
as and (ii) the 0(e“^ log e“^)-approximation factor of Lenzen and Patt-Shamir’s 
-f Z?)-time algorithm [STOC 2013] within the same running time. Our running 
time matches the known time lower bound of e(n^/^/logn-f U) [Das Sarma et al. STOC 
2011] modulo some lower-order terms, thus essentially settling the status of this problem 
which was raised at least a decade ago [Elkin SIGACT News 2004]. It also implies a (2 -f 
o(l))-approximation 0{n ^algorithm for approximating a network’s 
weighted diameter which almost matches the lower bound by Holzer et al. [PODC 2012]. 

In achieving this result, we develop two techniques which might be of independent 
interest and useful in other settings: (i) a deterministic process that replaces the “hitting 
set argument” commonly used for shortest paths computation in various settings, and 
(ii) a simple, deterministic, construction of an o{l))-hop set of size 0(n^+°*^^^). 

We combine these techniques with many distributed algorithmic techniques, some of 
which from problems that are not directly related to shortest paths, e.g. ruling sets 
[Goldberg et al. STOC 1987], source detection [Lenzen, Peleg PODC 2013], and partial 
distance estimation [Lenzen, Patt-Shamir PODC 2015]. Our hop set construction also 
leads to single-source shortest paths algorithms in two other settings: (i) a (1 -I- o(l))- 
approximation 0(n°^^^)-time algorithm on congested cliques, and (ii) a (1 -|- o(l))- 
approximation 0(n°(^Uog IT)-pass 0(n^+°(^Uog W)-space streamm^ algorithm, when 
edge weights are in {1, 2,..., IT}. The first result answers an open problem in [Nanongkai, 
STOC 2014]. The second result partially answers an open problem raised by McGregor 
in 2006 [sublinear. inf o. Problem 14]. 


*A preliminary version of this paper appears in Symposium on Theory of Computing (STOC) 2016. 
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1 Introduction 


In the area of distributed graph algorithms we study the complexity for a network to compute 
its own topological properties, such as minimum spanning tree, maximum matching, or 
distances between nodes. A fundamental question in this area that has been studied for many 
years is how much time complexity is needed to solve a problem in the so-called CONGEST 
model (e.g. [GKP98, PROO, ElkOG, DHK+12, LPS13]). In this model (see Section 2 for 
details), a network is modeled by a weighted undirected graph G, where each node represents 
a processor that initially only knows its adjacent edges and their weight, and nodes must 
communicate with each other to discover global topological properties of the network. The 
communication between nodes is carried out in rounds, where in each round each node can 
send a small message to each neighbor. The time complexity is measured as the number 
of rounds needed to finish the task. It is usually measured by n, the number of nodes in 
the network, and D, the diameter of the communication network (when edge weights are 
omitted). Typically, D n. 

In this paper, we consider the problem of approximating single-source shortest paths 
(SSSP). In this problem, a node s is marked as the source node, and the goal is for every 
node to know how far it is from s. The unweighted version - the breadth-first search 
tree computation ~ is one of the most basic tools in distributed computing, and is well 
known to require Q{D) time (e.g. [PelOO]). In contrast, the only available solution for 
the weighted case is the distributed version of the Bellman-Ford algorithm [Bel58, For56], 
which requires 0{n) time to compute an exact solution. In 2004, Elkin [Elk04] raised 
the question whether distributed approximation algorithms can help improving this time 
complexity and showed that any a-approximation algorithm requires ll((n/a)^/^/logn-|- D) 
time [ElkOG]. Das Sarma et al. [DHK+12] (building on [PROO, KKP13]) later strengthened this 
lower bound by showing that any poly(n)-approximation (randomized) algorithm requires 
D(n^/^/logn -|- D) time. This lower bound was later shown to hold even for quantum 
algorithms [EKN+14]. 

Since running times of the form^ 0(n^/^ -|- D) show up in many distributed algorithms 
(e.g. MST [KP98, PROO], connectivity [Thu97, PTll], and minimum cut [NS14, GK13]) 
it is natural to ask whether the lower bound of [DHK“'“12] can be matched. The first 
answer to this question is a randomized 0{e~^ log e“^)-approximation -|- Zl)-time 

algorithm by Lenzen and Patt-Shamir [LPS13]^. The running time of this algorithm is nearly 
tight if we are satisfied with a large approximation ratio. For a small approximation ratio, 
Nanongkai [Nanl4] presented a randomized (1 -|- o(l))-approximation -\- T))-time 

algorithm. The running time of this algorithm is nearly tight when D is small, but can be 
close to 0(n^/^) even when D = o(n^/^). This created a rather unsatisfying situation: First, 
one has to sacrihce a large approximation factor in order to achieve the near-optimal running 
time, and to achieve a (1 -|- o(l)) approximation factor, one must pay an additional running 
time of which could be as far from the lower bound as when D is large. Because 
of this, the question whether we can close the gap between upper and lower bounds for the 
running time of (1 -|- o(l))-approximation algorithms was left as the main open problem in 
[Nanl4, Problem 7.1]. Secondly, and more importantly, both these algorithms are randomized. 

^Throughout, we use O to hide polylogarithmic factors. 

®Note that the result of Lenzen and Patt-Shamir in fact solves a more general problem. 
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Given that designing deterministic algorithms is an important issne in distributed computing. 
This leaves an important open problem whether there is a deterministic algorithm that is 
faster than Bellman-Ford’s algorithm, i.e. that runs in sublinear-time. 

Our Results. In this paper, we resolve the two issues above. We present a deterministic 
(1 -|- o(l))-approximation -|- Z?^'’'°(^))-time algorithm for this problem (the o(l) 

term in the approximation ratio hides a 1/polylogn factor and the o(l) term in the running 
time hides an O (\/loglogn/logn) factor). Our algorithm almost settles the status of this 
problem as its running time matches the lower bound of Das Sarma et al. up to an 
factor. 

Since an a-approximate solution to SSSP gives a 2Q;-approximate value of the network’s 
weighted diameter (cf. Section 2), our algorithm can (2 -|- o(l))-approximate the weighted 
diameter within the same running time. Previously, Holzer et al. [HW12] showed that for 
any e > 0, a (2 — e)-approximation algorithm for this problem requires D(n) time. Thus, the 
approximation ratio provided by our algorithm cannot be significantly improved without 
increasing the running time. The running time of our algorithm also cannot be significantly 
improved because of the lower bound of / \ogn + D) [DHK+12] for approximate SSSP 

which holds for any poly(n)-approximation algorithm. 

Using the same techniques, we also obtain a deterministic (1 -|- o(l))-approximation 
( 9 ( 7 ^o(i))_time algorithm for the special case of congested clique, where the underlying 
network is fully-connected. This gives a positive answer to Problem 7.5 in [Nanl4]. Previous 
algorithms solved this problem exactly in time [Nanl4] and 0{n^^^) [CKK+15], 

respectively, and (l-|-o(l))-approximately in time [CKK+15]'^. We can also compute 

a (2 -|- o(l))-approximation of the weighted diameter within the same running time. Holzer 
and Pinsker proved that computing a (2 — o(l))-approximation of the diameter requires H(n) 
time in the worst case [HP15] in the congested clique. 

Our techniques also lead to a (non-distributed) streaming algorithm for (1 -|- o(l))- 
approximate SSSP where the edges are presented in an arbitrary-order stream, and an 
algorithm with limited space (preferably 0(nlog W), when edge weights are in {1, 2,.. .W}) 
reads the stream in passes to determine the answer (see, e.g., [McG14] for a recent survey). It 
was known that O(nloglT) space and one pass are enough to compute an 0(logn/loglogn)- 
spanner and therefore approximate all distances up to a factor of 0(log n/ log log n) [FKM+08] 
(see also [FKM+05, Bas08, EZ06, Elkll]). This almost matches a lower bound which holds 
even for the s-t-shortest path problem (stSP), where we just want to compute the distance 
between two specific nodes s and t [FKM+08]. On unweighted graphs one can compute 
(l-|-e,/3)-spanners in /3 passes and space [EZ06], and get (l-l-e)-approximate SSSP 

in a total of 0(13/e) passes. In 2006, McGregor raised the question whether we can solve stSP 
better with a larger number of passes (see [Sub]). Very recently Guruswami and Onak [G016] 
showed that a p-pass algorithm on unweighted graphs requires /0{p)) space. 

This does not rule out, for example, an 0(logn)-pass 0(n)-space algorithm. Our algorithm, 
which solves the more general SSSP problem, gets close to this: it takes 0(n°(^) log W) passes 
and log W) space. 

^With this running time, [CKK+15] can in fact solve the all-pairs shortest paths problem. Also see [LG16] 
for further developments in the direction of [CKK+15]. 
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Overview of Techniques. Our crucial new technique is a deterministic process that can 
replace the following “path hitting” argument: For any c, if we pick 0(c) nodes uniformly 
at random as centers (typically c = then a shortest path containing n/c edges will 

contain a center with high probability. This allows us to create shortcuts between centers - 
where we replace each path of length n/c between centers by an edge of the same length - 
and focus on computing shortest paths between centers. This argument has been repetitively 
used to solve shortest paths problems in various settings (e.g. [UY91, HK95, DI06, BHS07, 
RZll, San05, DFI05, DFR09, M^dlO, Berl6, LPS13, Nanl4]). In the sequential model a set 
of centers of size 0(c) can be found deterministically with the greedy hitting set heuristic 
once the shortest paths containing n/c edges are known [Zwi02, Kin99]. We are not aware 
of any non-trivial deterministic process that can achieve the same effect in the distributed 
setting. The main challenge is that the greedy process is heavily sequential as the selection 
of the next node depends on all previous nodes, and is thus hard to implement efficiently in 
the distributed setting®. 

In this paper, we develop a new deterministic process to pick 0(c) centers. The key new 
idea is to carefully divide nodes into 0(1) types. Roughly speaking, we associate each type 
t with a value wt and make sure that the following properties hold: (i) every path tt with 
Vtin/c) edges and weight Q{wt) contains a node of type t, and (ii) there is a set of 0{n/c) 
centers of type t such that every node of type t has at least one center at distance o{wt). 
We define the set of centers to be the collection of centers of all types. The two properties 
together guarantee that every long path will be almost hit by a center: for every path vr 
containing at least nfc edges, there is a center whose distance to some node in vr is o{w{'k)) 
where, t(;(7r) is the total weight of vr. This is already sufficient for us to focus on computing 
shortest paths only between centers as we would have done after picking centers using the 
path hitting argument. To the best of our knowledge, such a deterministically constructed 
set of centers that almost hits every long path was not known to exist before. The process 
itself is not constrained to the distributed setting and thus might be useful for derandomizing 
other algorithms that use the path hitting argument. 

To implement the above process in the distributed setting, we use the source detection 
algorithm of Lenzen and Peleg [LP13] to compute the type of each node. We then use the 
classic ruling set algorithm of Goldberg et al. [GPS88] to compute the set of centers of each 
type that satisfies the second property above. (A technical note: we also need to compute a 
bounded-depth shortest-path tree from every center. In [Nanl4], this was done using the 
random delay technique. We also derandomize this step by adapting the partial distance 
estimation algorithm of Lenzen and Patt-Shamir [LP15].) 

Another tool, which is the key to the improved running time, is a new hop set construction. 
An {h, e)-hop set of a graph G = (R, E) is a set F of weighted edges such that the distance 
between any pair of nodes in G can be (l-l-e)-approximated by their /i-hop distance (given by 
a path containing at most h edges) on G' = (R, EldF) (see Section 2 for details). The notion 
of hop set was defined by Cohen [CohOO] in the context of parallel computing, although it 
has been used implicitly earlier, e.g. [UY91, KS92] (see [CohOO] for a detailed discussion). 
The previous SSSP algorithm [Nanl4] was able to construct an (n/A:,0)-hop set of size kn, 

®We note that the algorithm of King [Kin99] for constructing a blocker can be viewed as an efficient way 
to greedily pick a hitting set by efficiently computing the scores of nodes. The process is as highly sequential 
like other greedy heuristics. 
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for any integer A; > 1, as a subroutine (in [Nanl4] this was called shortest paths diameter 
reduction^). In this paper, we show that this subroutine can be replaced by the construction 
of an o(l))-hop set of size 

Our hop set construction is based on computing clusters which is the basic subroutine 
of Thorup and Zwick’s distance oracles [TZ05] and spanners [TZ05, TZ06]. It builds on 
a line of work in dynamic graph algorithms. In [Ber09], Bernstein showed that clusters 
can be used to construct an ( 71 °*-^^ o(l))-hop set of size Later in [HKN14], we 

showed that the same kind of hop set can be constructed by using a structure similar to 
clusters while restricting the shortest-path trees involved to some small distance and use 
such a construction in the dynamic (more precisely, decremental) setting. The construction 
is, however, fairly complicated and heavily relies on randomization. In this paper, we build 
on the same idea, i.e., we construct a hop set using bounded-distance clusters. However, our 
construction is significantly simplified, to the point that we can treat the cluster computation 
as a black box. This makes it easy to apply on distributed networks and to derandomize. 
To this end, we derandomize the construction simply by invoking the deterministic clusters 
construction of Roditty, Thorup, and Zwick [RTZ05] and observe that it can be implemented 
efficiently on distributed networks^. We note that it might be possible to use Cohen’s hop 
set construction instead. However, Cohen’s construction heavily relies on randomness and 
derandomizing it seems significantly more difficult. 

Updates. After the preliminary version of this paper appeared ([HKN16]), [BKK+16] 
showed that the term in our bounds can be eliminated. Elkin and Neiman showed the 
first construction of sparse hop sets with a constant number of hops [ENlGa], and showed an 
application in computing approximate shortest paths from s sources. In particular, using 
our hop set and a modification of the framework in [Nanl4] and this paper^, this problem 
can be solved in 0((sn)^/^"’“°^^^ + rounds. Elkin and Neiman showed a hop set which 

can be used to reduce the bound to 0((sn)^/^ -f D) when s = In [EN16b], they also 

showed further applications of hop set in the distributed construction of routing schemes. R 
was pointed out by Patt-Shamir (see [TselS]) that using our algorithm as a black box, one 
can simplify and obtain improved running time in the construction of compact routing tables 
in [LP15]. (On the other hand, we note that our construction is based on many ideas from 
[LP15].) Our hop set construction also found applications in metric-tree embeddings [FL16]. 

Organization. We start by introducing notation and the main dehnition in Section 2. 
Then in Section 3 we explain the deterministic hop set construction in, which is based on 
a variation of Thorup and Zwick’s clusters [TZ05] In Section 4, we give our main result, 
namely the (1 + o(l))-approximation + U^'*“°^^^)-time algorithm. In that section 

we explain the deterministic process for selecting centers mentioned above, as well as how to 
implement the hop set construction in the distributed setting. Finally, our remaining results 
are proved in Section 5. 

®This follows the notion of shortest paths diameter used earlier in distributed computing [KP08] 

^We note that Thorup-Zwick’s distance oracles and spanners were considered before in the distributed 
setting (e.g. [LP15, DSDP15]) 

®More precisely, following Elkin and Neiman [EN16a], one constructs an overlay network of size y/sn 
instead of y/n as done in this paper. 
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2 Preliminaries 


2.1 Notation 


In this paper we consider weighted undirected graphs. For a set of edges E, the weight of 
each edge (tt, v) ^ E is given by a function w{u, v, E). If {u, v) ^ E, we set w{u, v, E) = oo. 
For a graph G = {V,E), we dehne w{u,v,G) = w{u,v,E). Whenever we dehne a set of 
edges E as the union of two sets of edges EiU E 2 , we set the weight of every edge {u, v) G E 
to w{u, V, E) = min(rt;(tt, v, Ei),w{u, v, E 2 )). We denote the weight of a path vr in a graph 
G by w{tt, G) and the number of edges of vr by |7r|. 

Given a graph G = {V, E) and a set of edges F C I/^, we define G U F as the graph that 
has V as its set of nodes and F U F as its set of edges. The weight of each edge (u, v) is 
given by w(u, v,G U E) = w{u, v, E U E) = min(rt;(tt, v, E),w{u, v, F)). 

We denote the distance between two nodes u and v, i.e., the weight of the shortest path 
between u and u, by d{u, v, G). We define the distance between a node u and a set of nodes 
A C V hy d{u, A, G) = min^g^i d{u, v, G). For every pair of nodes u and v we define distance 
up to range R by 


d{u, V, R, G) 


d{u,v,G) iid{u,v,G)<R 
00 otherwise. 


and for a node v and set of nodes A CV hy d{u, A, R, G) = d{u, v, R, G). 

For any positive integer h and any nodes u and v, we dehne the bounded-hop distance 
between u and u, denoted by d^{u,v,G), as the weight of the shortest among all u-v 
paths containing at most h edges. More precisely, let be the set of all paths 

between u and v such that each path vr G n^(u, u) contains at most h edges. Then, 
d^{u, V, G) = min^gjjh^^ w{ 7 r, G) if n^(u, v) / 0, and d^{u, v,G) = 00 otherwise. 

We denote the hop-distance between two nodes u and v, i.e., the distance between u and 
V when we treat G as an unweighted graph, by hop{u, v, G). The hop diameter of graph G 
is dehned as D{G) = max^ „gy(G) hop{u, v, G). When G is clear from the context, we use D 
instead of D{G). We note that this is different from the weighted diameter, which is defined 
as WD{G) = max„ „gy(-G) d{u,v,G). Throughout this paper we use “diameter” to refer to 
the hop diameter (as it is typically done in the literature). 

Given any graph G = {V, E), any integer h, and e > 0, we say that a set of weighted 
edges F is an {h, e)-hop set of G if 


d{u, V, G) < d^{u, V, F) < (1 + e)d{u, v, G ), 


where H = {V, E U F). In this paper we are only interested in o(l))-hop sets of size 

0 {n^^°A)y We refer to them simply as “hop sets” (without specifying parameters). 


2.2 The CONGEST Model and the Problem 

A network of processors is modeled by an undirected weighted n-node m-edge graph G, 
where nodes model the processors and edges model the bounded-bandwidth links between 
the processors. Nodes are assumed to have unique IDs in the range {1,2,..., poly(n)} and 
infinite computational power. We denote by A the number of bits used to represent each ID, 
i.e., A = O(logn). Each node has limited topological knowledge; in particular, every node u 
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only knows the IDs of each neighbor v and w{u, v, G). As in [Nanl4], we assume that edge 
weights are integers in {1,2,... ,W}, and W = poly log n. This is a typical assumption as it 
allows to encode the weight of an edge in one (or a constant number of) messages. 

The distributed communication is performed in rounds. At the beginning of each 
round, all nodes wake up simultaneously. Each node u then sends an arbitrary message of 
B = O(logn) bits through each edge {u,v), and the message will arrive at node v at the 
end of the round. The running time of a distributed algorithm is the worst-case number of 
rounds needed to finish a task. It is typically analyzed based on n (the number of nodes) 
and D (the network diameter). 

Definition 2.1 (Single-Source Shortest Paths (SSSP)). In the single-source shortest paths 
problem (SSSP), we are given a weighted network G and a source node s (the ID of s is 
known to every node). We want to find the distance between s and every node v in G, denoted 
by d{s,v,G). In particular, we want every node v to know the value of d{s,v,G). 

Recovering shortest paths. We note that although we define the problem to be com¬ 
puting the distances, we can easily recover the shortest paths in the sense that every node 
u knows its neighbor v that is in the shortest path between u and s. This is because our 
algorithm computes the distance approximation that satisfies the following property 

every node u / s has a neighbor v such that d'{s, v) + w{u, v, G) < d'{s, u), (1) 

where d\s,v) is the approximate distance between s and v. For any distance approximation 
d' that satisfies Equation (1), we can recover the shortest paths by assigning v as the 
intermediate neighbor of u in the shortest paths between u and s. 

It can be easily checked throughout that the approximate distance function that we 
compute satisfies Equation (1). This is simply because our algorithm always rounds an 
edge weight w{u,v,G) up to some weight w'{u,v), and computes the approximate distances 
based on this rounded edge weight. For this reason, we can focus only on computing the 
approximate distances in this paper. 

2.3 Toolkit 

In the following we review known results used for designing our algorithm. The first is a 
weight-rounding technique [Coh98, Zwi02, Ber09, M^dlO, Berl6, Nanl4] that we repeatedly 
use to scale down edge weights at the cost of approximation. As we will use this technique 
repeatedly, we give a proof in Appendix A for completeness 

Lemma 2.2 ([Nanl4]). Let h >1 and let G be a graph with integer edge weights from 1 

to W. For every integer 0 < i < [nVFj, set Pi = ^ and let Gi be the graph with the same 

nodes and edges as G and weight w{u,v,Gi) = every edge {u,v). Then for all 

pairs of nodes u and v and every 0 < i < [nITj 

Pi ■ d{u,v,Gi) > d{u,v,G). (2) 

Moreover, i/2® < d^{u,v,G) < 2®+^, then 

d{u,v,Gi) < {1 + 2/e)h and (3) 

Pi ■ d{u,v,Gi) < {1 + e)d^{u,v,G). (4) 
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An important subroutine in our algorithm is a procedure for solving the source detection 
problem [LP13] in which we want to find the a nearest “sources” in a set S for every node u, 
given that they are of distance at most 7 from u. Ties are broken lexicographically. The 
following definition is from [LP15] 

Definition 2.3 ((S, 7, (T)-detection). Consider a graph G, a set of “sources” S C V{G), 
and parameters 7, cr G N. For any node u let L{u, S, 7, a, G) denote the list resulting from 
ordering the set {{d{u,v,G),v)\v G 5 A d{u,v,G) < 7} lexicographically in ascending order, 
i.e., where 

{{d{u,v,G),v) < {d{u,v',G),v') 

{d{u, V, G) < d{u, v', G)) V {d{u, v, G) = d{u, v', G) f\v < v'), 

and restricting the resulting list to the first a entries. The goal of the {8,^, a)-detection 
problem is to compute L{u, S,'y,a,G) for every node u G y{G). In the distributed setting we 
assume that eaeh node knows 7, a, and whether it is in S or not and the goal is that every 
node u £ V(G) knows its list L{u, S, 7, cr, G), 

Lenzen and Peleg designed a source detection algorithm for unweighted networks [LP13]. 
One can also run the algorithm on weighted networks, following [LP15, proof of Theorem 
3.3], by simulating each edge of some weight L with an unweighted path of length L. Note 
that nodes in the paths added in this way are never sources. 

Theorem 2.4 ([LP13]). In the CONGEST model there is an algorithm for solving the 
{S, 7, a)-detection problem in min (7, WD) + min (cr, | 5 |) rounds on weighted networks, where 
WD is the weighted diameter. 

We also use another source detection algorithm: Roditty, Thorup, and Zwick [RTZ05] 
also solve a variant of the source detection problem with 7 = 00 in their centralized algorithm 
for computing distances oracles and spanners deterministically. They reduce the source 
detection problem to a sequence of single-source shortest paths computations on graphs with 
some additional nodes and edges. Their algorithm can easily be generalized to arbitrary 7. 

Theorem 2.5 (implicit in [RTZ05]). In the sequential model, the {S,^, a)-detection problem 
in directed graphs with positive edge weights can be solved by performing a single-source 
shortest paths computations up to distance 7 on graphs with at most 0{n) nodes and 0{m) 
edges. 

Here we give a short sketch of the algorithm. The algorithm works in phases, in the j-th 
phase it finds for every node v £ V the j-th source node of S of distance at most 7 (with 
ties broken arbitrarily). In the first phase we find for each node u G R its closest source 
node of distance at most 7 as follows: We add an artificial super-source s* to G with a 
0-length edge from every node in S to s*. Then we perform a single-source shortest path 
computation with source s* up to distance at most 7, resulting in a shortest-path tree rooted 
at s*. For each node u G R its ancestor in this tree that is a child of s* is its closest source. 
We use Uj{v) to denote the (up to) j closest sources for v that the algorithm found in the 
first j phases for 1 < j < cr. In phase j > 1 we construct a graph Gj with the following 
node set: We add to R a new super-source node s* and a copy u of every node u £ Ai. The 
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graph Gj contains the following edges: For each new node u it contains a 0-length edge to 
s*. Additionally for each edge {y,v) G E, we add the edge (with length w{y,v,G)) to Gj if 
Uj-i{v) = Uj-i{y) and otherwise we add the edge {y, u) with length w{y, v, G) -\- d{v, u, G), 
where u is the source in Uj-i{v) \ Uj-i{y) with smallest distance from v (with ties broken 
lexicographically). This edge is a “shortcut” that replaces all possible paths from y through 
the edge (y, f) to a source not in Uj-i{y) by an edge to the node u representing the closest 
such source u. Then we perform a single-source shortest path computation with source s* 
up to distance at most 7 in Gj. The j-th closest source for each node v is its ancestor that 
is a child of s* in the resulting shortest-path tree. Note that each graph Gj consists of at 
most n -|- |5| -|- 1 nodes and m -|- |5| edges and, as there are a phases, the algorithm requires 
a single-source shortest path computations. 

Another subproblem arising in our algorithm is the computation of ruling sets The 
following dehnition was adapted from the recent survey of Barenboim and Elkin [BE13, 
Section 9.2]. 

Definition 2.6 (Ruling Set). For a (possibly weighted) graph G, a subset U C V{G) of 
nodes, and a pair of positive integers a and j3, a set T CU is an {a, fd)-ruling set for U in 
G if 

1. for every pair of distinct nodes u,v €T, it holds that d{u,v,G) > a, and 

2. for every node u G U \ T, there exists a “ruling” node v € T, such that d{u, v, G) < (3. 

The classic result of Goldberg et al. [GPS 88 ] shows that in the distributed setting, for 
any c > 1, we can compute a (c, cA)-ruling set deterministically in 0(c log n) rounds, where 
A is the number of bits used to represent each ID in the network. Since it was not explicitly 
stated that this algorithm works in the CONGEST model, we sketch an implementation of 
this algorithm in Appendix C (see [BE13, Ghapter 9.2] and [PelOO, Ghapter 22] for more 
detailed algorithm and analysis). 

Theorem 2.7 (implicit in [GPS 88 ]). In the CONGEST model there is an algorithm that, for 
every c> 1, computes a {c, cX)-ruling set in O(clogn) rounds, where A is the number of bits 
used to represent each ID in the network. 

3 Deterministic Hop Set Construction 

In this section we present an deterministic algorithm for constructing an o(l))-hop set. 

In Section 3.2 we give an algorithm that computes a set of edges F that reduces the number 
of hops between all pairs of nodes in the following way for some fixed A > 1: if the shortest 
path has weight R, then using the edges of F, we can find a path with 0{R/A) edges at the 
cost of a multiplicative error of o(l) and an additive error of Our algorithm obtains 

F by computing the clusters of the graph. We explain clusters and their computation in 
Section 3.1. In Section 3.3, we show how to repeatedly apply the first algorithm for different 
edge weight modihcations to obtain a set of edges F providing the following stronger hop 
reduction for all pairs of nodes: if the shortest path has h hops, then, using the edges of F, 
we can hnd a path with 0{h/A) hops at the cost of a multiplicative error of o(l) and no 
additive error. Finally, in Section 3.4, we obtain the hop set by repeatedly applying the hop 
reduction. 


10 


3.1 Deterministic Clusters 

The basis of our hop set construction is a structure called cluster introduced by Thorup and 
Zwick [TZ05] who used it, e.g., to construct distance oracles [TZ05] and spanners [TZ06] of 
small size. Consider an integer p such that 2 < p < log n and a hierarchy of sets of nodes 
{Ai)Q<i<p such that Aq = V, Ap = 0, and Aq^ Ai^ ... ^ Ap. We say that a node v has 
priority i if veA,\ A+i (for 0 < i < p — 1). For every node v € V the cluster of v in G is 
defined as 

C{v, A,G) = {u \ d{u, V, G) < d{u, ^i+i, G)} , 
where i is the priority of v. 

To get efficient algorithms in the models of computation we want to consider, we do not 
know how to efficiently compute clusters as defined above, but will compute the following 
restricted clusters up to distance R instead. For any node v of priority i, let 

C{v, A, R, G) = {u £ V \ d{u, V, G) < d{u, ^j+i, G) and d{u, v, G) < R} . 

3.1.1 Computing Priorities A 

The performance of our algorithms relies on the total size of the clusters, which in turn 
depends on how we compute nodes’ priorities. If randomization is allowed, we can use the 
following algorithm due to Thorup and Zwick [TZ05, TZ06]: set Aq = V and Ap = 0, and 
for 1 < i < p — 1 obtain Ai by picking each node from Ai-i with probability (Inn/n)^/^. 
It can be argued that for A = (^i)o<i<p the size of all clusters, i.e., J2veV |G(u,AG)|,is 
in expectation. We now explain how to deterministically compute the priorities 
of nodes (given by a hierarchy of sets of nodes A = (^i)o<i<p) such that the total size of the 
resulting clusters is 0{pn^~^^^^). 

Thorup and Zwick [TZ05] introduced the notion of bunches to analyze the sizes of clusters. 
For every node u £ V, we define the bunch and, for every 0 < i < p — 1, the bunch, both 
restricted to distance R, as follows: 

Bi{u, A, R, G) = {v £ Ai \ AiJ^i \ d{u, v, G) < d{u, ^i+i, G) and d{u, v, G) < R} 
B{u,A,R,G)= [J Bi{u,A,R,G) 

0<i<p-l 

The crucial insight is that v £ B{u, A, R, G) if and only if u G C{v, A, R, G). Thus, it suffices 
to choose a hierarchy of sets Ai such that \Bi{u,A,R,G)\ < 0{n^^P) for every u £V and 
0 < i < p — 1. 

Our algorithm for deterministically computing this hierarchy of sets of nodes follows the 
main idea of Roditty, Thorup, and Zwick [RTZ05]. Its pseudocode is given in Procedure 1. 
As a subroutine this algorithm solves a weighted source detection problem, i.e., for suitable 
parameters q, A, and R, it computes for every node v the set L{v, A, R, q, G) containing the 
q nodes of A that are closest to u - up to distance R; if there are fewer than q nodes of 
A in distance R to u, L{v, A, R, q, G) contains all of them. Our algorithm for constructing 
the hierarchy of sets (Aj)o<i<p is as follows. We set Aq = V and Ap = % and to construct 
the set Aj+i given the set Ai for t) < i < p — 2 we first find for each node v £ V the set 
L{v, Ai, R,q,G) using a source detection algorithm. Then we view the collection of sets 
{L{v, Ai, R, q, G)}y£v as an instance of the hitting set problem over the universe Ai, where 
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we want to find a set ^j+i C Ai of minimal size such that each set L{v, Ai, R, q, G) contains 
at least one node of ^j+i. We let Aj+i be a hitting set of approximately minimum size® 
determined by the deterministic greedy heuristic (always adding the element “hitting” the 
largest number of “un-hit” sets) to produce a hitting set within a factor of 1 -f Inn of the 
optimum [Joh74, ADP80]. In the following we prove the desired bound on the size of the 
bunches, which essentially requires us to argue that setting q = 0 (n^/^) is sufficient. 


Procedure 1: Priorities(G, p, R) 


Input: Weighted graph G = (V,E) with positive integer edge weights, number of 
priorities p > 2, distance range R> 1 
Output: Hierarchy of sets (Ai)o<i<p 


1 

2 

3 

4 

5 

6 
7 


|'2n^/^ln (3n)(l-f Inn)] 

Aq •(— V 

for i = 0 to p — 2 do 

Compute L{v, Ai, R,q,G) for every node v G V using a source detection algorithm 

foreach v gV do 

1^ if \L{v, Ai, q,R,G)\ = q then C C U {L{v, Ai, R, q, G)} 
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Compute an approximately minimum hitting set Hj+i C Ai using a greedy 
heuristic 


9 Ap i — 0 

10 return A = (Hj)o<j<p 


Lemma 3.1 (Implicit in [RTZ05]). Given a finite collection of sets C = {Si,..., Sq} over a 
universe U and a parameter x > 1 such that |Si| > 2xln3g for all 1 < i < q, there exists a 
hitting set T C U of size \T\ < \U\/x such that T n S* / 0 for all 1 < i < q. 

We give a proof of Lemma 3.1 in Appendix B for completeness. 

Lemma 3.2. Procedure 1 computes a hierarchy A = (Aj)o<i<p of sets of nodes such that 

Y.C{u,A,B^,a)=Y. B{u,A,R,G) =0(pn^+^/P). 

u&V ueV 

Proof. We first show by induction that | Aj| < for allO<i<p—1. Iff = 0 the claim 

is trivially true because we set Aq = V. We now assume that |Aj| < and argue that 

\Ai\ < Our algorithm approximately computes a minimum hitting set of the 

collection of sets C containing each set L{v, Ai, q, R, G) of size q. By Lemma 3.1 we know 
that there is a hitting set A' for C of size at most |Aj|/(n^/P(l -f Inn)) < n^“*/^/(l -f Inn) 
and thus the minimum hitting set Aj+i computed by the greedy heuristic has size at most 
(1 -|- lnn)|A'| < Note that each set L{v, Ai, q, R, G) might have been empty and in 

this case the algorithm would have computed Aj+i = 0, the trivial hitting set. 

®In principle, local computation is free in the models considered in this paper and we could thus compute 
a minimum hitting set exactly. However, we decided to present the algorithm in a way that avoids solving 
NP-complete problems by local compuation. 
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We now show that for every node u & V and every 0 < i < p — 1, Bi{u,A,R,G) < q = 
log n), which immediately implies the desired bound on the total size of the bunches 
and clusters. We argue by a simple case distinction that Bi{u, A, R, G) C L{u, Ai, R, q, G) 
and thus, by the dehnition of the set of the q closest nodes in Ai, \Bi{v,A,R,G)\ < 
\L{u, Ai, R,q,G)\ < q. If \L{u,Ai,R,q,G)\ < q, then clearly L{u, Ai, R,q,G) = {v £ Ai \ 
d{u,v,G) < R} ^ Bi{v,A,R,G). Otherwise we have \L{u, Ai, R, q,G)\ = q and, as the 
algorithm computed a suitable hitting set, we have Bi{u, A, R, G) C L{u, Ai, R, q,G). □ 

As an alternative to the algorithm proposed above, the hitting sets can also by computed 
with the deterministic algorithm of Roditty, Thorup, and, Zwick [RTZ05] which produces 
so-called “early hitting sets”. For this algorithm we have to set q = log n), and obtain 

slightly smaller clusters of total size 0{p'n}~^^^^). However, since the logarithmic factors are 
negligible for our purpose, we have decided to present the simpler algorithm above. 

3.1.2 Computing Clusters 

Given the priorities of the nodes, the clusters can be computed by hnding a shortest-path 
tree that is “pruned” at nodes whose distance to the root is more than (or equal to) their 
distance to nodes of higher priority than the root. In the pseudocode of Procedure 2 we 
formulate this algorithm as a variant of weighted breadth-first search. We will not analyze 
the performance of this algorithm at this point since it depends on the models of computation 
that simulate it (see Section 3.1.1 and Section 5 for implementations of the algorithm). 

We summarize our guarantees with the following theorem. 

Theorem 3.3. Given a weighted graph G with positive integer edge weights and parameters 
p and R, Procedure 2 computes a hierarchy of sets A = (Aj)o<i<p, where V = Aq C Ai C 
••• ^ Ap = %, such that I ^)l ~ 0{pn^~^^^P). R also computes for every 

node V the set C{v,A,R,G) and for each node w e C{v,A,R,G) the value of d{v,w,G). 

3.2 Hop Reduction with Additive Error 

Consider the following algorithm for computing a set of edges F. First, deterministically 
compute clusters with p = [-yiog n/ log (9/e)J priorities (determined by a hierarchy of sets 
A = {Ai)o<i<p) up to distance R = Let F be the set containing an edge for every 

pair (u, v) G such that v G C{u, A, R, G) and set the weight of such an edge {u, v) £ F 
to w{u, V, F) = d{u, V, G), where the distance is returned by the algorithm for computing 
the clusters. Procedure 3 gives the pseudocode of this algorithm. 

Lemma 3.4. Let F be the set of edges computed by Procedure 3 for a weighted graph 

G = {V,E) and parameters A > 1 and 0 < e < 1. Then F has size , where 

p = [a/ (log n) / (log (9/e))J, and in the graph H = GU F, for every pair of nodes u and v, 
we have 

^{p+i)\d(u,v,G)/lF\ G) < (1 + e)d{u, V, G) + en^/?’A/(p + 2), 

i.e., there is a path tt' in H of weight w{'k', H) < {l + e)d{u,v,G) + en^/Pl^/{p + 2) consisting 
of \tt'\ < {p + l)\d{u,v,G)/A] edges. 
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Procedure 2: Clusters(G, p, R) 

Input: Weighted graph G = {V,E) with positive integer edge weights, number of 
priorities p > 2, distance range R> 1 
Output: Clusters of G as specified in Theorem 3.3 

1 {Ai)o<i<p ^ Priorities(G, p, R) 

2 For each 1 < i < p — 1 and every node v € V compute d{v, Ai, R, G) 


3 

4 

5 

6 

7 

8 
9 


10 

11 

12 

13 

14 


foreach it £ F do // Compute cluster of every node 

// Initialization 

Let i be the priority of u, i.e., u £ Ai \ Ai+i 
foreach v £ V do 5{u, r) oo 
6{u, li) •(— 0 

C{u) £- 0 

// Iteratively add nodes to cluster 
for L = 0 to ii do 

foreach node v with 5{u,v) = L do 

// Check if v joins cluster of u at current level 

if 5{u,v) < d{v,Aij^i,R,G) then 
C{u) £- C{u) U {r} 

foreach {v,w) £ E do // Update neighbors of v 

6\u, w) £- {w{v, w, G) + 6{u, v)) 

if (5' (li, It) < (5(tt, id) then 6{u,w)-<^6'{u,w) 


15 return ((^(r),(i(r, •))„gy 


Procedure 3: HopReductionAdditiveError(G, A, e) 

Input: Graph G = (V, E) with non-negative integer edge weights, A>1, 0<e<l 
Output: Hop-reducing set of edges P C as specified in Lemma 3.4 


/ log I 

V iog(9A)J 

j^i/PA 


1 p 

2 R 

3 0 

4 {G{v),5{v,-))^(ZV 


Clusters(G, p, R) 

5 foreach u £ F do 

6 foreach v £ C{u) do 

7 F F U {{u, r)} 

8 w{u,v, E) = 6{u,v) 


9 return F 
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We devote the rest of this section to proving Lemma 3.4. The bound on the size of F 
immediately follows from Theorem 3.3. We analyze the hop-reducing properties of F by 
showing the following. Let vr be a shortest path from u to n in G. Then there is a node w 
on TT and a path tt' from u to w \n H = GU F with the following properties: 

(1) The distance from tt to tc in G is at least A. 

(2) The path vr' consists of at most p edges of F and at most one edge of G. 

(3) The ratio between the weight of tt' in H and the distance from u to re in G is at most 
(1 -|- e) it w ^ V and if rc = n then the weight of tt' in H is at most /3 (for some j3 that 
we set later). 

When we go from u to w using the path tt' instead of the subpath of vr we are using a 
“shortcut” of at most p + 1 hops that brings us closer to u by a distance of at least A at the 
cost of some approximation. Conditions (1) and (2) guarantee that by repeatedly applying 
this shortcutting we can find a path tt" from u to u that has at most {p + 1) \d{u, v, G)/A] 
hops (as we replace subpaths of tt with weight at least A by paths with at most p+1 hops). 
Condition (3) guarantees that the multiplicative error introduced by using the shortcut is 
at most 1 + e, except possibly for the last time such a shortcut is used, where we allow an 
additive error of p. We will show that we can guarantee a value of /3 that is bounded by 
en^/pj^p -|- 2). This type of analysis has been used before by Thorup and Zwick [TZ06] to 
obtain a spanner for unweighted graphs defined from the partial shortest path trees of the 
clusters, but without considering the hop-reduction aspect. Bernstein [Ber09] also used a 
similar analysis to obtain a hop set for weighted graphs using clusters with full distance 
range. We previously used this type of analysis to obtain a randomized hop set which is not 
based on clusters, but on a similar notion [HKN14]. 

To carry out the analysis as explained above we define a value r* for every 0 < i < p — 1 
as follows: 


ro = A 

(4 -|- 2e) J 2 o<j<i-i Pj 

n = --. 

e 

The intuition is that a node u of priority i tries to take an edge of F to shortcut the way to v 
by at least r^. If this fails it will find an edge in F going to a node v' of higher priority. Thus, 
to fulfill Condition (3), v' has to try and shortcut even more “aggressively”. Consequently, 
the values of rj grow exponentially with the priority i. 

We have chosen the range of the clusters large enough such that nodes of the highest 
priority will always find the desired shortcut edge in F. We will show that the additive error 
incurred by this strategy is at most 


j3= 2ri. 

0<i<p-l 

This value can in turn be bounded as follows. 
Lemma 3.5. (3 < en i/pA/(p + 2). 
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Proof. We first show that, for all 0 < z < p — 1, J2o<j<i''^j ^ 7*A/eh The proof is by 
induction on i. For z = 0 we have ro = A = 7^A by the definition of ro and for z > 1 we use 
the inequality e < 1 and the induction hypothesis as follows: 

2^ rj +ri= rj + -- 

o<i<* o<i<i-i o<i<i-i ^ 

^ So<j<i-i Tj 6 So<j<i-i ^ 7 1]o<j<i-i ^ 7 • 7^~^A ^ 7^A 

~ e € ~ e ~ € ■ e*“^ ~ e* 

Using this inequality and the fact that {2p + 7)7^“^ < 9^ for all p > 0 we get 

{p + 2)/3 _ (p + 2) ^o<i<p-i < (2p + 4)7P ^A ^ ^ ri^/PA 

e € ~ eP ~ eP ~ 

The last inequality holds as by our choice of p = [\/logn/log(9/e)J: 

^log n , /fi / \ _ y^log (9/e) , 

(^Q/e)P = < 2n/'°s(9/'=) '^°® = 2Viog"V*°s(®A) = 2 ” = 2^°s"-(i/p) = n^/P . 

□ 

In the following we fix some values of e and A and let F denote the set of edges computed 
by Procedure 3. We now show that F has a certain structural property before we carry out 
the hop-reduction proof. 

Lemma 3.6. Let u and v be nodes such that u G Ai\Ai+i has priority i and d{u, v, G) < ri. 
Either 

(1) F contains an edge {u,v) of weight w{u,v,F) = d{u,v,G) or 

(2) F contains an edge {u,v') to a node v' G Ai+i of priority j > z-|-l of weight w{u,v\F) < 

2ri. 

Proof. Consider first the case v G C{u,A,R,G). Then F contains the edge {u,v) of weight 
w{u, V, F) = d{u, V, G). 

Consider now the case v ^ C{u,A,R,G). Note that by the dehnition of /3 we have 
Pi F fd /2 < fd and by Lemma 3.5 we have /? < zz^/^A. As the algorithm sets R = zzVpA we 
have Pi < Rhy Lemma 3.5 and thus (i(zz, v, G) < R. From the dehnition of C{u, A, R, G) 
it now follows that d{v,u,G) > d{v, Ai^i,G). Thus there exists some node vi G Aj+i of 
priority pi > z -|- 1 such that d{v,vi,G) < d{u,v,G). By the triangle inequality we get 
d{u, vi, G) < d(u, V, G) -|- d{v, vi,G) < 2d{u, v, G) < 2rj < R. If zz G C{vi,A, R, G) then we 
are done as F contains the edge (zz, z;i) of weight w{u, vi, F) = d{u, vi,G) < 2rj. Otherwise 
it follows from the dehnition of C{vi,A, R, G) that there is some node V 2 G of priority 

P 2 >Pi + l >z-|-l such that d{u,V 2 ,G) < d{u,vi,G) < 2rj < R. By repeating the 
argument above we want to hnd some node vj G Aj+i of priority pj > z -|- 1 such that 
d{u, Vj, G) < d{u, V, G) < 2ri. As for every node v' G Ap_i of priority p — 1, C{v', A, R, G) 
contains all nodes that are at distance at most R from F in G this repeated argument stops 
eventually and we hnd such a node. □ 
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To finish the proof of Lemma 3.4 we show in the next lemma that F has the properties 
we demanded, i.e., in the shortcut graph H which consists of G and the additional edges of 
F, we can approximate shortest paths using a reduced number of hops. 

Lemma 3.7. For every pair of nodes u,v & V such that d{u, v, G) < oo, the graph H = GUF 
contains a path n' from u to v of weight w{'k\H) < (1 + €)d{u,v,G) + ft consisting of 
K'l < (p + 1) |'d(u, u, G)/A] edges. 

Proof The proof is by induction on the distance from u to u in G. The claim is trivially 
true for the base case d{u, v,G) = 0 in which u = v. Thus, we only need to consider the 
induction step in which d{u, v, G) > 1. 

Let vr denote the shortest path from tt to u in G. We now define a sequence of nodes 
uo,ui,... ,ui. For every 0 < j < /, we denote by pj the priority of Uj. We set uq = u and, 
given Uj, we define rtj+i as follows. Let w be the node on vr closest to v that is at distance 
at most rp. from Uj in G (this node might be v itself). If H contains the edge {uj,w) we 
stop (and set I = j). Otherwise we know by Lemma 3.6 that H contains an edge {uj,u') 
to a node u' of priority at least pj + 1. In that case we set Uj+i = u'. We know further by 
Lemma 3.6 that d{uj, Uj+i, G) < Having defined the sequence uo,ui,... ,ui, we 

denote by w the node on vr closest to v that is at distance at most rp^ from ui in G (again, 
this node might be v itself). Figure 1 illustrates the definition of this sequence. 



Figure 1: Schematic illustration of the dehnition of the sequence of nodes uq, ui,... ,ui,w. 
The bottom line represents the shortest path from u to v. The thick, blue edges are the 
edges of F used to shorten the distance to v. The dashed, blue edges are not contained in F 
and imply the existence of edges to nodes of increasing priority. The dotted lines indicate 
repetitions that are omitted in the picture. 

Consider first the case that w = v. Let vr' denote the path (uq, ... ,ui, w). This path 
has at most p hops and since d{u,v,G) > 1 we trivially have p < {p + l)\d{u,v,G)/IiF\. 
Furthermore we can bound the weight of tt' as follows: 

w{tt',H)= w{uj,Uj+i,H) +w{ui,w,H) < Y d{uj,Uj+i,G) + d{ui,w,G) 

o<j<i-i 
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< E 2 r, 

0<j<p-l 

= /S < (1 + e)d{u, v,G) + f3. 

Consider now the case w ^ v. Let w' be the neighbor of u) on vr (that in G is closer to v 
than w is). We will define the path vr' from tt to n as the concatenation of two paths vri and 
7r2. Let TTi be the path (no, ... ,ui,'w, w'). We will define the path 7r2 from w' to v later on. 
Note that vri consists of |pi| < p + 1 hops. We will now show that 

w{'Ki, H) < {1-\-e)d{u,w',G). (5) 

In order to get this bound we will need some auxiliary inequalities. By Lemma 3.6 we have, 
for all 0 < j < / — 1, 

d{uj,Uj+i,G) < (6) 

and by the definition of Vp^ we have 

= (4 + 2 e) ^ Tj . ( 7 ) 

Remember that w is the node on vr closest to v that is at distance at most rp. from ui in G. 
Since the neighbor w' of w is closer to v then w is, this definition of w guarantees that 
d{ui,w' , G) > Tp-. As d{ui,w' , G) < d{ui,w, G) + d{w, w' , G) by the triangle inequality, we 
have 

d{ui,w,G) + d{w,w',G) > rp. . (8) 

By the triangle inequality we also have 

d{ui,w,G) < E d{uj,Ujj^i, G) + d{u, w, G). 


and thus 


d{ui,w,G) - E d{uj,Uj+i,G) < d{u,w,G) . 


(9) 


We now obtain Inequality (5) as follows: 
w{7ri,H) = E w{uj,Uj+i, H) + w{ui^ w, H) + w{w, w , H) 

= ^ d{uj,Ujj^i,G) + d{ui,w,G) + d{w,w',G) 

= (2 + e) E d{uj,Uj+i,G) + d{ui,w,G) + d{w,w',G) - {I + e) ^ d{uj,Uj+i,G) 

o<j<«-i 

( 6 ) 

<(2 + e) ^ 2rp.^^_i +d{ui,w,G) + d{w,w\G) - {1 +e) ^ d{uj,Uj+i,G) 

o<j<«-i 0<j<Z-l 

^ (2 + e) E 2rj-\-d(ui,w,G^-\-d{w,w iG^ — e) ^ Uj+i, G) 

0<j<pi-l 0<j<l-l 

= erp^+d{ui,w,G) + d{w,w ,G) - {1 +e) ^ d{uj,Uj+i,G) 

o<i<z-i 
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< 


e{d{ui,w,G) + d{w,w',G)) + d{ui,w,G) + d{w,w',G) - {1 + e) ^ d{uj,Uj+i, 


= (1 + e) I d{ui,w, G) - ^ d{uj,Uj+i, G) j + (1 + e)d{w, w', G) 

\ 0<j<l-l ) 

(9) 

< (1 + e)d{u, w, G) + (1 + e)d{w, w', G) 

< (1 + e){d{u, w, G) + d{w, w', G)) = (1 + e)d{u, w', G) 


Note that d{w\ v, G) < d{u, v, G). Therefore we may apply the induction hypothesis on 
w' and get that the graph H contains a path it 2 of weight w(7T2, H) < (1 + e)d{w', v,G) + 
that has |7r2| < (p + l)\d{w',v,G) / A~\ hops. Let tt' denote the concatenation of vri and 712 - 
Then tt' is a path from tt to u in Lf of weight 


wiir', H) = w{'Ki,H) + w{'K2,H) 

< (1 + e)d{u, w' , G) + (1 + e)d{w', v,G) + P 
= (1 + e){d{u, w', G) + d{w', v, G)) + (5 
= (1 + e)d{u,v, G) + fd . 


It remains to bound the number of hops of vr'. To get the desired bound we first show 
that d{u, w', G) > A. By the triangle inequality we have 

d{ui,w',G)< ^ d{uj,Uj+i,G) + d{u,w',G). 

As argued above, we have d{ui,w',G) > rp- and J2o<j<i-id{uj,Uj+i,G) < J2o<j<pi-i‘^''^j- 
By the definition of rp^ we therefore get: 

d{u,w',G)>d{ui,w',G)— ^ d{uj,Uj+i,G) 

>rpi- = (4/e) ^ rj > ro = A . 

0<j<Pl-^ 0<j<p;-l 

Now that we know that d{u,w',G) > A, or equivalently d{u^w',G)/A > 1, we get the 
following for counting the number of hops of tt' by adding the number of hops of tti to the 
number of hops of 112 '■ 

Ivr'l = IttiI + |7r2| < p + 1 + (p + 1) \d{w' G)/A] = (p + 1)(1 + \d{w' G)/A]) 

= {V + '^)\'^ + d{w' ,v,G) / A~\ 

< (p+ l)\d{u,w',G)/A + d{w',v,G)/A)'\ 
= (p+ l)\{d{u,w ,G) + d{w ,v,G))/A~\ 

= {p+"^)\d{u,v,G)/A~\ 


Thus, tt' has the desired number of edges. 


□ 


3.3 Hop Reduction without Additive Error 

Consider a shortest path tt from u io v with h hops and weight i? > A. With the hop 
reduction of Procedure 3 we can compute a set of edges F such that in G U F we find a path 
from u to u with 0{R/A) hops of weight approximately R. We now use the weight-rounding 
technique of Lemma 2.2 and repeatedly apply this algorithm to obtain a set of edges F such 
that in G U F there is a path from u to u with 0{h/A) hops and weight approximately R. 
As in general R can only be upper-bounded by nW (where W is the maximum edge weight 
of G) and h can be upper-bounded by n, it is clear that the second type of hop reduction 
is more desirable. Additionally, if h is sufficiently larger than A, then the additive error 
inherent in the hop reduction of Procedure 3 can be counted as an additional multiplicative 
error of e. 

The second hop reduction algorithm roughly works as follows. For every possible distance 
range of the form 2F .. 2^^^ we scale down the edge weights of G by a certain factor and run 
the algorithm of Procedure 3 on the modihed graph Gj to compute a set of edges Fj. We 
then simply return the union of all these edge sets (with the weights scaled back to normal 
again). Procedure 4 shows the pseudocode of this algorithm. 


Procedure 4: HopReduction(G, A, /i, e, W) 

Input: Weighted graph G = (V,E) with integer edge weights from 1 to W, A > 1, 
h > 1, 0 < e < 1 

Output: Hop-reducing set of edges F C 1/^ as specified in Lemma 3.8 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 


e' e/6 
A' ^ 3A/e' 


F ^ 0 

for j = 0 to [log(nW)J do 
Pj •(— e'2^/h 

foreach (u, u) G F do w{u,v,Gj) •(— \w{u,v, E)/pj~\ 
Fj ^ HOPREDUCTIONADDITIVEERROR(Gj, A', e') 
foreach (u, v) G Fj do 
F ^ F U {(n, u)} 

w{u^ V, F) •(— min(t(;(u, v, Fj) ■ pj, w{u, v, F)) 


12 return F 


Lemma 3.8. Let F C V'^ be the set of edges computed by Procedure 4 for a weighted 
graph G = {V,E) and parameters A > 1, h > 1, and 0 < e < 1. Then E has size 
0{pn^^^^P log nW), where p = [a/( log n)/(log (54 /e))\, and if h > n^^PA/{p + 2), then in 
the graph H = G U E we have, for every pair of nodes u and v, 

u, F) < (1 + e)d^{u, V, G). 

Proof. Let u and u be a pair of nodes and set j = [log d^{u, v, G)J, i.e., 2^ < df‘{u, v, G) < 
2FI. Let TT be a shortest < h hop path in G, i.e., vr has weight w{'k, G) = d^{u, v, G) and 
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vr consists of |7r| < h hops. The algorithm sets e' = e/6 and uses a graph Gj which has 
the same nodes and edges as G, but in which every edge weight is first scaled down by a 
factor of pj = e'2Gh and then rounded up to the next integer. By Lemma 2.2 we have 
d{u, V, Gj) ■ Pj < (1 + e')d^{u, v, G) and d{u, v, Gj) < (1 + 2/e')h < 3hje'. 

Since d^{u, v, G) < nW the algorithm has computed a set of edges Fj. By Lemma 3.4, 
there is a path vr' in Hj = Gj U Fj of weight at most (1 + e')d{u, v, Gj) + e'n^G/^' j (p+ 2) and 
with at most \'k'\ < (p+ 1) \d{u, v, Gj)/A'] hops. Since d{u, v, Gj) < 3/i/e' and the algorithm 
sets A' = 3A/d we have 


W\ < (p +1) 


d(n, n, Gj) 

A^ 


< (P+ 1) 


3/i 

7a' 


= (p +1) 


<(p+l)(^ 


I (p I 1) < I {p+^)h ^ {v+l)h ^ h 



{p + 2)h 

A 


The algorithm “scales back” the edge weights of Fj when adding them to F and thus 
w{u, V, F) < w{u, V, Fj) ■ Pj. We now argue that H) < {1 + e')d^{u, v, G) by 

bounding the weight of n' in H = G U F. For every edge {u, v) of tt' we have w{u, v, H) < 
w{u,v,F) < w{u,v,Fj) ■ Pj if {u,v) G Fj and w{u,v,H) < w{u,v,G) < w{u,v,Gj) ■ pj 
otherwise. Thus, w{t^', H) < w{'k', Hj) ■ pj and together with the assumption h > n^/PA/(p + 
2) we get 


^(p+2)h/A^^^ U, H) < H) < w{H, Hj) ■ Pj < 


^(1 + d)d{u, V, Gj) + 


e'ni/PA'\ 

p + 2 ) 


■Pj 


/I /^ 7/ ^ e'n^^PA'pj 

= (1 + e )d{u, V, Gj) • Pj + — 

p z 

/N 7/ ^ 3e'2%VPA 

= (1 + e )d{u, V, Gj) ■ Pj + ^ 

< (1 + e)d{u, V, Gj) ■ Pj + 3€2^ 

< (1 + e)‘^d^{u, V, G) + 3ed^{u, v, G) 

< (1 + 6e)d^{u, V, G) 

= {1 + e)d^{u,v,G). □ 


3.4 Computing the Hop Set 

We hnally explain how to repeatedly use the hop reduction of Procedure 4 to obtain an 
o(l))-hop set. Procedure 4 computes a set of edges F that reduces the number of hops 
needed to approximate the distance between any pair of nodes by a factor of 1/A (where 
A is a parameter). Intuitively we would now like to use a large value of A to compute a 
hop set. However, we want to avoid large values of A for two reasons. The first reason is 
that F only reduces the number of hops if the shortest path has h > Are°(^^ hops. Thus, for 
shortest paths that already have h < A hops the hop reduction is not effective. The second 
reason is efficiency. The algorithm requires us to compute clusters for distances up to 
and, in the models of computation we consider later on, we do not know how to do this fast 
enough for our purposes. 
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We therefore use the following iterative approach in which we repeatedly apply the hop 
reduction of Procedure 4 with A = We first compute a set of edges Fi that reduces 

the number of hops in G by a factor of 1/A. We then add all these edges to G and consider 
the graph Hi = G U Fi. We apply the algorithm again on Hi to compute a set of edges F 2 
that reduces the number of hops in Hi by a factor of 1/A. Now observe that the set of edges 
Fi U F 2 reduces the number of hops in G by a factor of 1 /A^. We show that by repeating 

this process p = 0(y^logn/ log (\/log ^Z^)) times we can compute a set F that reduces the 
number of hops to Procedure 5 shows the pseudocode of this algorithm. 


Procedure 5: HopSet(G, e, W) 

Input: Weighted graph G = {V, E) with integer edge weights from 1 to VP, 0 < e < 1 
Output: (n^/^,e)-hop set F as specified in Theorem 3.9 


1 eW— 

2 W' f 

3 p <— 

4 A ^ 

5 F <— 

6 Ho ^ 


2y/\og n 

- (1 + e)nW 

I log n I 

V log(54A')J 
(p + 2)n^/P 


G 


7 for i = 0 to p — 1 do 


8 

9 

10 

11 


hi ^ 

Fi+i ^ HOPREDUCTION(i/i, A, hi, e', W') 
F ^ F U Fi+i 
Hi+i Fj U Fj_|_i 


12 return F 


Theorem 3.9. Let F C V'^ be the set of edges computed by Procedure 5 for a weighted 
graph G = (14,F) and a parameter 0 < e < 1. Then F is an {n^^P,e)-hop set of size 

0(p^n^+^/^lognVP), where p = (log n)/(log (108-v/log ra/^J. 

Proof. The algorithm sets e' = e/(2^1og n) and p = [ 1 / (logn)/(log (54/e'))J and uses a 
parameter hi = n^~^^P for each graph Hi. For every 0<i<p — 2we set hi = n^~'^/P > 
n^/P = n^/P/S.I{p + 2) and thus, by Lemma 3.8, for every pair of nodes u and v we have 

d!^^+^{u,v,H,+i) = d^^/^"'\u,v,Hi+i) = d^P+^'^'^^/^{u,v,H,+i) < {l + d)d}^^{u,v,Hi). 

By iterating this argument we get 

d!^^{u, V, Hi) < (1 + e'yd^^{u, v, Ho) = (1 + e'ydJ^{u, v, G) = (1 + e')*d(n, v, G) 
for every 1 < i < p — 1 and now in particular for i = p — 1 we have 

d^'^’’{u,v,Hp_i) =d'^p-yu,v,Hp_i) < {l + e')^P-^U{u,v,G). 
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Finally, since p — 1 < -yiogn we have, by Lemma 3.10 below. 




< 1 + e. 


As Hp-i = GUF it follows that {u, v, GUF) < {l+e)d{u, v, G) and thus F = Ui<i<p-i 
is an e)-hop set. □ 


Lemma 3.10. For allQ < x <1 and all y > 0, 


Proof. Let e denote Euler’s constant. We will use the following well-known inequalities: 
(1 -|- \lzY < e (for all z > 0), < 1/(1 — z) (for all z < 1), and 1/(1 — z) <\ F2z (for all 

0 < z < 1/2). We then get: 


1 + 




< 62 < 


1 

1^ 


< 1 -I- X . 


□ 


The main computational cost for constructing the hop set comes from computing the 
clusters in Procedure 3, which is used as a subroutine repeatedly. Observe that in total 
it will perform 0{plognW) calls to Procedure 2 to compute clusters, each with p = 

Q{\J (logn)/(log (Vlogn/^) priorities and distance range R = 0{p\Aognn‘^/Pje) on a 
weighted graph of size 0(m-|-log (nW)). Note that if 1/e < polylog n, then = 
Thus, Procedure 5 will then compute an o(l))-hop set of size 0{n^~^°^^YogW) 

and it will perform O(loglT) cluster computations with p = 0(•\/log n/ log log n) priorities 
up to distance range 0{n°^^Y graphs of size logW) each. 


4 Distributed Single-Source Shortest Paths Algorithm on Net¬ 
works with Arbitrary Topology 

In this section we describe a deterministic distributed algorithm for computing distances 
from a source node s. It consists of two parts. The first part is constructing a suitable 
overlay network. A randomized construction algorithm was given in [Nanl4] such that it was 
sufficient to solve SSSP on the resulting overlay network in order to solve the same problem 
on the whole network. We give a deterministic version of this result in Section 4.1. The 
second part is a more efficient algorithm for computing SSSP on an overlay network using 
Procedures 2 to 5 from before (see Section 4.2). In Section 4.3, we show how to hnish the 
computation following [Nanl4]. 

4.1 Computing an Overlay Network Deterministically 

An overlay network (also known as landmark or skeleton [Soml4, LPS13]) as defined in 
[Nanl4] is a virtual network G' of nodes and “virtual edges” that is built on top of an 
underlying real network G; i.e., V{G') C V{G) and E{G') = V{G') x V{G') such that the 
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Figure 2: An overview of the main steps of our algorithm. The left picture depicts the input 
graph. Thick edges and nodes (in black) in the middle picture depicts a possible overlay 
network. Dashed edges (in red) in the right picture depicts a possible hop set of the overlay 
network. 


weight of an edge in G’ is an approximation of the distance of its endpoints in G and is oo 
if no path exists between them in G. The nodes in V{G') are called centers. Computing 
G' means that after the computation every node in G' knows whether it is a center and 
knows all virtual edges to its neighbors in G' and the corresponding weights. We show in 
this subsection that there is a algorithm that constructs an overlay network G’ 

of 0{^/n/e) nodes such that a (1 + e)-approximation to SSSP in G\ can be converted to a 
(1 + 7e)-approximation to SSSP in G, as stated formally below. 

Theorem 4.1. Given any weighted undirected network G with source node s and any 
parameter 0 < e < 1, there is a 0{^/n)-time deterministic distributed algorithm that computes 
an overlay network G' and some additional information for every node with the following 
properties. 

• Property 1: \V{G')\ = 0(>/n/e) and s G V{G'). 

• Property 2: For every node u G V{G), as soon as u receives a (1 + e)-approximation 
d{s,v) of d{s,v,G') for all centers v G V{G'), it can infer a (1 + 7e)-approximation of 
d(s, u, G) without any additional communication. 

Before proving the above theorem, we first recall how it was achieved with a randomized 
algorithm in [Nanl4]^*^ (see Theorem 4.2 of the arXiv version^^ of [Nanl4] for details). 

• In the first step of [Nanl4], the algorithm selects each node to be a center with 
probability Q{l/y/n) and also makes s a center. By a standard “hitting set” argument 
(e.g. [UY91, DFI05]), any shortest path containing ^/n edges will contain a center with 
high probability. Also, the number of centers is ©(-^/n) with high probability. 

• In the second step, the algorithm makes sure that every node v knows the ©(\/n)-hop 
distances between v and all centers using a light-weight bounded-hop single-source 
shortest paths algorithm from all centers in parallel combined with the random delay 
technique to avoid congestion. 

note that [Nanl4] proved this theorem for general parameters A and a but we will only need it for 
A = a = ^/n. 

^^http://arxiv.org/pdf/1403.5171v2.pdf 
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We derandomize the first step as follows: In Section 4.1.1 we assign to each node u a type, 
denoted by t{u). (To compute these types, we invoke the source detection algorithm of 
Lenzen and Peleg [LP13], as we will explain in Section 4.1.1.) The special property of nodes’ 
types is that every path vr containing ^/n edges contains a node u of a “desired” type; in 
particular, t{u) is not too big compared to w{tt,G) (see Lemma 4.2 for details). This is 
comparable to the property obtained from the hitting set argument, which would be achieved 
if we made the special node of every path a center. However, this may create too many 
centers (we want the number of centers to be 0(y/n/e)). Instead we select some nodes to be 
centers using the ruling set algorithm, as described in Section 4.1.2. After this, we get a 
small set of centers such that every node u of type t{u) is not far from one of the centers. 
Thus, while we cannot guarantee that the path vr contains a center, we can guarantee that it 
contains a node that is not far from a center (see Lemma 4.3 for details). To derandomize 
the second step, we use the recent algorithm of Lenzen and Patt-Shamir [LP15] for the 
Partial Distance Estimation (PDE) problem together with the above Procedures 2 to 5, as 
we will explain in Section 4.1.3. 

Parameters. The parameters used by our algorithm in the following are e' = e/(18A), 
h = [e'-y/nj, h' = {1 + 2je')h < 3y^, h* = 9Xy/n, k = 2h* + 2 -y/n, and A:' = (1 + 2le')k. 
Recall that A is the number of bits used to represent each ID in the network. 

4.1.1 Types of Nodes 

For any integer i, we let pi = ^ and let Gi be the graph with the same nodes and edges 
as G and weight w{u,v,Gi) = (lAlLLfill for every edge {u,v). Note that we have chosen h' 
such that d{u, v, Gi) < h' by Equation (3) of Lemma 2.2. For any node u, let the ball of u 
in Gi be B{u, Gi, h') = {u e V{Gi) \ d{u, v, Gi) < h'}. Note that for any index i and nodes 
u and V, d{u,v,Gi-\.i) < d{u,v,Gi); thus, B{u,Gi,h') C il(tt, Gj+i,/i'). Let the type t{u) of 
u be the smallest index i such that \B{u,Gi,h')\ > h. We crucially exploit the following 
structural property. 

Lemma 4.2. For every path tt of G consisting of |7r| = ^/n edges there is a node u on n 
such that 2*(“) < 2e'w{'K,G). 

Proof Let I = [| 7 r|/h] > 1/e and let x and y denote the endpoints of vr. Partition vr into 
the path -Kx consisting of the {I — l)h edges closest to x and the path vr^ consisting of the 
| 7 r| — {I — l)h edges closest to y. Further partition tTu into I — 1 non-overlapping subpaths of 
exactly h edges, and expand the path iTy by adding edges of tTx to it until it has h edges. 
Thus, there are now I paths of exactly h edges each and total weight at most 2w{tt, G). It 
follows that there exists a subpath vr' of vr consisting of exactly h edges and weight at most 
2w{tt, G)/l < 2e'w{'K, G). Let u and v be the two endpoints of vr' and let i be the index such 
that 2* < dP'{u,v,G) < 2*'’'^. By Equation (3) of Lemma 2.2 it follows that d{u,v,Gi) < h', 
which implies that B{u,Gi,h') contains tt’. Hence \B{u,Gi,h')\ > h and t{u) < i. This 
shows that 2 *^“) < 2 * < d^{u,v,G) < w{tt',G) < 2e'w{'K,G). □ 

Computing Types of Nodes. To compute t{u) for all nodes u, it is sufficient for every 
node u to know, for each i, whether \B{u,Gi,h')\ > h. We do this by solving the {S,'y,cr)- 
detection problem on Gi with S = V{G), 7 = h' and a = h, i.e., we compute the list 
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L{u, S,'y,a,G) for all nodes u, which contains the a nodes from S that are closest to u, 
provided their distance is at most 7 . By Theorem 2.4 this requires 0(7 + d) = 0{h + h') = 
0{y/n) rounds. For any node n, \L{u^V{G)^h\h,G)\ = h if and only if \B{u,Gi,h')\ > h. 
Thus, after we solve the (S', 7 , iT)-detection problem on all Gj, using 0{-yn) rounds, every 
node u can compute its type t(u) without any additional computation. 


4.1.2 Selecting Centers via Ruling Sets 

Having computed the types of the nodes, we compute ruling sets for the nodes of each type 
to select a small subset of nodes of each type as centers. Remember the two properties of an 
(a, /i)-ruling set T of a base set U: (1) all nodes of T are at least distance a apart and (2) 
each node in ?7 \ T has at least one “ruling” node of T in distance (3. We use the algorithm 
of Theorem 2.7 to compute a {2h' + 1, {2h' + l)A)-ruhng set Ti for each graph Gi where the 
input set Ui consists of all nodes of type i. The number of rounds for this computation is 
0{h') = 0{^/n). We define the set of centers as V = (Ui^i) U {s}. Property (1) allows us 
to bound the number of centers and by property ( 2 ) the centers “almost” hit all paths with 
y/n edges. 

Lemma 4.3. (1) The number of centers is \V'\ = 0{y/n/e). (2) For any path vr containing 
exactly y/n edges, there is a node u in vr and a center v ^ V' such that d^ {u,v,G) = 
ew{'K,G)), where h* = 9\/nA. 

Proof (1) For each i, consider any two nodes u and v in Tj. Since d{u,v,Gi) > 2h' by 
Property (1) of the ruling set, B{u, Gi, h') n B{v, Gi, h') = 0. As every node u G Ti is of 
type i, \B{u, Gi, h')\ > h for every u G Ti. We can therefore uniquely assign h nodes to every 
node u G Ti and thus \Ti\ < n/h = 0{y/n/e). 

(2) By Lemma 4.2, there is a node u in vr such that 2*^“^ = 2e'w{'K, G). Moreover, there 
is a center v in the ruling set such that d{u, v, Gti^W)) < {2h' + 1)A < 3/i'A < h*. (The 
last inequality is because h' < 3^/n.) Let tt' be the shortest path between u and v in Gn^Wj. 
Then w{t^',G t(^u)) = d{u,v,Gt(^W)) < and as a consequence tt' contains at most h* edges. 
It follows that 


d^ {u,v,G) <w{tt',G) 

= H w{x,y,G) 

{x,y)&E(-K') 

< XI Pt{^) ■ (recall that w{x, y, Gt(u)) = ) 

(x,y)&E(-K') 

< Pt{u) ■ rc(7r',Gt(„)) 

Pt(u) ■ d{u, V, Gi(^W)) 

= ^ d{u,v,Gt(u)) 


= 9A2*(“) 

< 18Ae^u)(7r, G) 
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ew{'K,G)). 


□ 


4.1.3 Computing Distances to Centers 

Recall that we use parameters defined above Section 4.1.1. Let k = 2h* + 2^/n, where 
h* = 9^/nX (as in Lemma 4.3). In this step, we compute for every node u and every center 

V a value d{u, v) that is a (1 + e)-approximation of v, G) such that each node u knows 
d{u, v) for all centers v. In particular we also compute d{u, v) for all pairs of centers u and 
V. To do this we follow the idea of partial distance estimation [LP15]. As in Section 4.1.1, 
we do this by solving the source detection problem on a graph with rounded weights.For 
any integer i, let (pi = ^. Let Gi be the weighted graph such that w{u,v,Gi) = 

for every edge (u, v) in G. 

We solve the (5, 7 , cr)-detection problem on Gi for all 0 < i < [log nlTj, with parameters 
S = V, 7 = k', and a = \V'\ (where \V'\ = 0{^yn/€) by Lemma 4.3). Using the algorithm 
of Theorem 2.4 repeatedly this takes 0(7 + cr) = 0{y/n) rounds. At termination, every node 
u knows the distances up to k' to all centers in all Gp i.e., it knows d{u, v, k', Gi) for all i 
and all centers v. For every node u and center v we set d{u, v) = minj{(/jj • d{u, v, k', Gi)}. 
Every node u can compute d{u, v) without any additional communication as soon as the 
source detection algorithm is finished. Now, observe that for the index i* such that 2* < 
d^{u,v,G) < 2**"*“^, it follows from Equation (3) of Lemma 2.2 that d{u,v,Gi*) < k' which 
implies that d{u,v,k',Gi*) = d{u,v,Gi*). With Equations (2) and (4) it then follows that 

d{u, V, G) < d{u, v) < ipi* ■ d{u, v, k', Gi*) = pi* ■ d{u, v, Gi*) < (1 + e)d^{u, v, G) . 

Hence d{u,v) is the desired (1 + e)-approximation of d^{u,v,G). 

4.1.4 Completing the Proof of Theorem 4.1 

We define our final overlay network to be the graph G' where the weight between any two 
centers u,v G V{G') is d{u,v) (as computed in Section 4.1.3). Additionally, for every node 
u G V{G) we store the value of d{u,v) to all centers v G V{G'). We now show that all 
properties stated in Theorem 4.1 hold for Gk Since we need 0(*/n) rounds in Sections 4.1.1 
and 4.1.3 and 0(^/n) rounds in Section 4.1.2, the running time to construct G' is 0{*/n). 
Moreover, |U(G^)| = 0{y/n) as shown in Lemma 4.3. This is as claimed in the first part of 
Theorem 4.1. It is thus left to prove the following statement in Theorem 4.1: “for every node 
u G U(G), as soon as u receives a (1 + e)-approximation d{s,v) of d{s,v,G') for all centers 

V G U(G'), it can infer a (1 + 7e)-approximate value of d{u,s,G) without any additional 
communication.” 

Consider any node u and let tt be the shortest path between s and u in G. If tt contains 
less than */n edges, then d^{u, s, G) = d{u, s, G) and thus the value d{u, s) known by u is 
already a (1 + e)-approximate value of d{u,s,G). If vr contains at least */n edges, then 
partition tt into subpaths ttq, tti, ..., (for some t) where ttq contains s, tt* contains u, ttq 
contains at most edges, and every subpath except vro contains exactly edges. By 

note that the algorithm and analysis described in this subsection is essentially the same as the proof of 
[LP15, Theorem 3.3]. We cannot use the result in [LP15] directly since we need a slightly stronger guarantee, 
which can already be achieved by the same proof. (We thank Christoph Lenzen for a communication regarding 
this.) 
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Lemma 4.3, for every 1 < i < t, there is a node Ui and a center Vi such that (i) Ui is in vTj and 
(ii) {ui,Vi,G) = ew{'Ki,G)). Additionally since Ui and Ui+\ lie on tt, their shortest path is 
the subpath of tt between them and, thus, it consists of at most 2^/n edges. It follows that 
d?‘^{ui,UiJ^i,G) = Uj+i, G). By our choice of fc = 2h* + 2y/n the triangle inequality 
gives 

d’^{vi,Vi+i,G) < d^\vi,Ui,G) + d?'^{ui,Ui+i,G) + d!^*{ui+i,Vi+i,G) 

< d{ui,Ui+i,G) + ew{'Ki,G) +w{'Ki+i,G). 

By the same argument, d^{s, ui, G) < d{s, ui,G) + ew{Tri, G) and d^{u, vt, G) < d{u, ut, G) + 
ew{Trt,G). As every edge {x,y) in G' has weight d{x,y), we get 

d{s, Vt) + d{u, Vt) < (1 + e)d{s, vt, G') + d{u, vt) 


< (1 + e) ^d{s,vi,G') +^d(ui,Ui+i,G')^ +d{u,vt) 

< (1 + e) ^d{s, ui) + ^ d{vi,Vi+i)^ + d{u, vt) 

< (1 + e)^ vi,G) + ^ d^{vi,Vi^i, G)^ + (1 + e)d^{u, vt) 

< (1 + e)^ ui, G) + ^ d^{vi,Vi^i, G) + d^{u, vt, G)^ 

< (1 + e)^ ^d{s,u,G) + e^u;(7ri,G)^ 


= (1 + e)^ {d{s, u, G) + ed{s, u, G)) 

< (1 + e)^d{s, u, G) 

< (1 + 7e)d{s, u, G). 

Thus, when u receives d{s,v') for all centers v e V{G'), it can compute minj,gv'(c'/)((i(s, u) + 
d{u,v)), which, as we just argued, is a (1 + 7e)-approximation of d{s,u,G). 

4.2 Computing a Hop Set on an Overlay Network 

We now show how to simulate the hop set algorithm presented in Section 3 on an overlay 
network G' and how to compute approximate shortest paths from s in G' using the hop 
set. Throughout the algorithm we will work on overlay networks whose node set is the set 
of centers V{G'), but which might have different edge weights as, e.g.. Procedure 4 calls 
Procedure 3 on overlay networks with modihed edge weights. Thus, we will use G" to refer 
to an overlay network on which Procedures 2 to 5 run to emphasize the fact that they might 
not equal G'. We let N be the number of centers in G' and G". Thus N = 0(n^/^). 

4.2.1 Computing Bounded-Distance Single-Source Shortest Paths 

We will repeatedly use an algorithm for computing a shortest-path tree up to distance R 
rooted at s on an overlay network G'\ where R = 0(re°^^^). At the end of the algorithm every 
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node knows this tree. We do this in a breadth-first search manner, in i? -|- 1 iterations. Like 
in Dijkstra’s algorithm, every node keeps a tentative distance 5{s, u) from s and a tentative 
parent in the shortest-path tree, i.e., a node v such that 6{s,u) = 5(s,u) + w{u,v,G"). 
Initially, (5(s, s) = 0 and 6{s, v) = oo for every node v ^ s. In the iteration, for L from 
0 up to R, all nodes in G" whose tentative distance (5(s, u) is exactly L broadcast^^ to all 
other nodes a message (u, (I(s, u),v) where v is the parent of u. Using this information, every 
node u will update (“relax”) its tentative distance 5{s,u) and its tentative parent. 

Clearly, after the iteration, centers that have distance L + 1 from s (i.e., that are at 
level L in the shortest-path tree) will already know their correct distance. Thus, at the end 
of the last iteration every center knows the shortest-path tree rooted at s up to distance 
R in G”. To analyze the running time, note that over R rounds we broadcast N messages 
in total, and if messages are broadcast in the iteration, then this iteration takes 
0{mi + D) rounds. (We emphasize that the number of rounds depends on the diameter D 
of the original network, and not of G".) The total number of communication rounds used 
over all iterations is thus 0{RD + = 0{RD -|- N). 

4.2.2 Computing Priorities 

We will simulate Procedure 1. All necessary parameters can be computed beforehand and 
thus do not require any communication. Initially every center knows that it is contained 
in Aq = V. To compute Aj+i given that Ai is known (i.e., every center knows whether 
it is in Ai or not), we compute the list L{v, Ai, R, q,G") for every center v using a source 
detection algorithm and distribute each list to every center. Then every center runs the same 
deterministic greedy hitting set approximation algorithm to compute Aj+i. We will obtain 
(Ai)o<i<p by repeating this for p iterations. Thus, we only have to solve the (5, 7 , cT)-source 
detection problem with S = Ai, 'j = R, and a = q on an overlay network G”. For this 
purpose we simulate the source detection algorithm of Roditty, Thorup, and Zwick [RTZ05] 
(see Theorem 2.5). 

Recall that in the first phase of this algorithm, we add an artificial source s* to G with 
a 0 -length edge between s* and every center in S and perform a single-source shortest 
paths computation from s* up to distance 7 . We do this in the same way as the bounded- 
distance shortest-path tree algorithm described in Section 4.2.1, except that we initially set 
5(s*, u) = 0 for every center u G S. The subsequent iterations in this phase are exactly as 
described in Section 4.2.1. Note that due to the broadcast messages every center knows 
that shortest-path tree from s* in G" at the end of the first phase. In the next phase, we 
have to compute a shortest-path tree on the graph G”. We observe the following from the 
construction of G” described in Section 3.1. 

Observation 4.4. Every center u in G" will know all edges incident to it in G” if it knows, 
for every center v in G", (1) Ui{v) and (2) the distance (in G") from v to all centers in 

'^^More precisely, there is a designated node (e.g. the node with lowest ID) that aggregates and distributes 
the messages (via upcasting and downcasting on the breadth-first search tree of the network G), and tells 
other nodes when the iteration starts and ends. 

Although the algorithm of Lenzen and Peleg [LP13] (see Theorem 2.4) was designed for the distributed 
setting, we do not know how to simulate it on the overlay network with the running time guarantees we 
require. In particular we have no guarantee that in 0{D) rounds each node of the overlay network can send 
a message to all its neighbors, which is allowed in one round of the original algorithm. The reason is the 
congestion we obtain by individual nodes being part of many paths used for hop set edges. 
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Ui{v). In particular, it will know this information if it knows the shortest-path tree obtained 
in the previous phase. 

Since the shortest-path tree obtained in the previous phase is known to every center, we 
can assume that every center u in G" knows all its incident edges in Gf at the beginning of 
the phase. Now we compute the shortest-path tree up to distance i? in a similar manner as 
in Section 4.2.1. The only exceptions are that (1) initially every center u in G" knows that 
V has distance 0 to s*, for every center v in Gf, and they use this information to initialize 
the value of 6{s*,u) and (2) that they use the adjacent edges in Gf and not in G" to update 
their distance to s*. 

The subsequent phases are simulated in the same manner; i.e., using the shortest-path 
tree computed in the previous phases, every center knows its incident edges in G'-. This 
allows us to compute the shortest-path tree in G'- in the breadth-first search manner. After 
we finish all a phases, every center in G" will know L{v,S,^,a,G") for all v. Using the 
aforementioned analysis of the bounded-distance shortest-path tree, it follows that every 
phase takes 0{'yD + N) rounds and as there are a rounds the whole source detection 
algorithm takes 0{a^D + ctN) rounds For computing the priorities we repeat this for all p 
priorities and thus get the following. 

Lemma 4.5. The above algorithm deterministically computes a hierarchy (Aj)o<i<p 
in Lemma 3.2 in 0{pq{RD + N)) rounds. With our choice of parameters (N = 
p < logn, q = and R = the running time of the above algorithm is 

0(ni/2+°(i) -hT)n°(i)). 

4.2.3 Computing Clusters 

We now describe how to compute clusters in G”. At the end of the cluster computation every 
center will know the cluster G{u) of every center u. We do this by simulating Procedure 2. 
First, we need to compute d{v, Aj+i, R, G), for every I < i < p. This can be done in exactly 
the same way as in the first phase of computing the hierarchy i.e., we simply add a 

virtual source s* and edges of weight zero between s* and centers in Ai, and compute the 
shortest-path tree up to distance R rooted at s*. Since such a tree can be computed in 
0{RD + N) rounds and we have to compute p < logn such trees, the total time we need 
here is 0{RD + N). 

Next, we compute a shortest-path tree up to distance R from every center u in G", as 
described in Procedure 2. That is, at Iteration L (starting with L = 0 and ending with 
L = R), every center v having (a) 5{u,v) (the tentative distance from u to v) equal to 
L and (b) additionally 6(u,v) < d{v, Ai^i, R,G) will broadcast^^ its distance to u to all 
other centers so that every other center, say w, can (1) update its tentative distance 6{u,w) 
and (2) add v and 6{u, v) to its locally stored copy of G{u). It thus follows that for every 
center u the number of messages broadcast during the computation of C{u) is |G(u)|. Thus, 
there are J2u£V{G") |C'('u)| messages broadcast in total, which is bounded from above by 
0{pN^/^) = 0(re^/2+o(i)j clue to Theorem 3.3. 

Note that this procedure computes G{v), for all centers v, in parallel. Each Iteration L 
requires 0{D + J2v£V{G") '^v,l) rounds, where niy^L is the number of messages needed to be 

note again that to do this, there is a designated center that aggregates and distributes the messages 
(via upcasting and downcasting), and tells other centers when the iteration starts and ends. 
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broadcast at Iteration L in order to compute C{v). The total number of rounds over all R 
iterations is thus 

0{RD + E E m^^L) = 0{RD+ Y. \C{v)\) = 0{n°^^'>D + n^/^+°^^^). (10) 

0<L<Rv^V{G") v^V{G") 

Note that since the computation is done by broadcasting messages, every center knows 
cluster C{v) for all v at the end of this computation. 

Lemma 4.6. For any overlay network G" with centers, the above algorithm deter¬ 

ministically computes clusters up to distance R = (where every center knows C{u) for 
all centers u) that satisfies Theorem 3.3 in 0{n°^^^D + rounds. 

4.2.4 Computing the Hop Reduction with Additive Error 

We will simulate Procedure 3. All necessary parameters can be computed beforehand and 
thus do not require any communication. We can then execute Clusters(G, p, R) using the 
above algorithm to get {C{v), S{v, ■))vev- With this information the set F can be computed 
without any additional communication. Thus, executing Clusters(G, p, R) is the only 
part for computing F that requires communication. By Lemma 4.6 the total time needed to 
execute Procedure 3 is therefore 

0(n°(^)L> + n^/2+o(i)^ _ 

4.2.5 Computing the Hop Reduction without Additive Error 

We will simulate Procedure 4. All necessary parameters can be computed beforehand and thus 
do not require any communication. Moreover, every center knows about the edges incident 
to it and can thus compute by scaling down edge weights without any communication. 
We then execute Procedure 3 to compute F(. Knowing Fr we can compute F without any 
additional communication. Thus, executing Procedure 3 is the only part for computing F 
that requires communication and it is executed O(lognlT) times. As simulating Procedure 3 
takes time D (D) 

, as argued above (cf. Equation (11)), the total time needed 

to execute Procedure 4 is 

log (nW) + n^/2+°(i) bg (nW)) . (12) 

4.2.6 Computing the Hop Set 

We will simulate Procedure 5. All necessary parameters can be computed beforehand. 
Computing Fj+i is done by simulating Procedure 4 on the graph Hi, which, as argued 
above (cf. Equation (12)), takes time D log {nW) -\-log {nW)). As every 

center knows its incident edges, the graph Rj+i can be computed from Ej+i without any 
additional communication. As we execute Procedure 4 p < logn times, the total time needed 
to simulate Procedure 5 is 

0{pn°^^^Dlog (nW) -F log (nW)) = 0(n"*'^^T>log (nW) + log (nW)). 

By running this algorithm on G' (which, as pointed out, involves performing hop reductions 
and computing clusters on some other overlay networks), we obtain the following theorem. 
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Theorem 4.7. There is a deterministic algorithm that computes a hop set of G' in 
D log {nW) + log (nPT)) rounds. In the end of the process, every center 

knows every edge in the hop set. 

4.2.7 Routing via the Hop Set 

Remember that the overlay network is computed using the source detection algorithm of 
Lenzen and Peleg [LP13]. If a node x of the overlay network wants to send a message to one 
of its neighbors y in the overlay network it can do so by routing the message along a path in 
the original network whose length is upper-bounded by the weight of the overlay edge {x,y). 
This routing can be obtained by modifying the source detection algorithm to additionally 
construct BPS trees rooted at the sources (see [LP13]), which in our case are the nodes of 
the overlay network. 

When we compute the hop set on the overlay network we broadcast all computed clusters 
to all nodes in the network. In this way the clusters, the corresponding partial shortest 
path trees of the clusters, as well as the hop set edges become global knowledge. Therefore 
every node in the overlay network learns for every hop set edge (x, y) to which path from 
X to y in the overlay network it corresponds. Thus, also for every hop set edge (x, y) of 
the overlay network, x can send a message to y by routing the message along a path in the 
original network whose length is upper-bounded by the weight of the overlay edge (x, y). This 
means that the hop set computed by our algorithm has the following awareness property, as 
introduced in the full version of [ENlGb]: A hop set F for a graph G is called aware if for 
every hop-set edge (x, y) G T of weight b there exists a corresponding path vr in G between 
X and y of length b. Furthermore, every node x on vr knows dTr{v,x) and dT^(v,y), and its 
neighbors on vr. 

4.3 Final Steps 

Let H be the graph obtained by adding to G' the edges of the o(l))-hop set computed 

above. To (1 + o(l))-approximate d{s,v,G') for every center v in G', it is sufficient to 
(1 -|- o(l))-approximate the /i-hop distance d^{s,v, H), for some h = The latter task 

can be done in 0{hD + |F(G')|) = -|- ^i/ 2 -i-o(i)^ rounds by the same method as 

in Lemma 4.6 in the full version of [Nanl4]. We give a sketch here for completeness. Let 
e = 1/logn. For any 1 < z < log(nVF), let iL' be the graph obtained by rounding edge 
weights in H' as in Section 4.1.1; i.e., for every edge {u,v) we set w{u,v,H'f) = ) ~| ^ 

where pi = ^. For each Hi, we compute the shortest-path tree rooted at s up to distance 
R = 0{h/e), which can be done in 0{RD + D + rounds, using 

the procedure described in Section 4.2. This gives d{s,v, R, Hi) for every center v. We then 
use the following value as (1 -f o(l))-approximation of d^{s, v, H) (and thus of d{s, v, G'))\ 
d{s, v) = miuj pid{s, v, R, H'f). The correctness of this algorithm follows from Lemma 2.2. 

Once we have (1 -|- o(l))-approximate values of d{s,v,G') for every center v G F(G'), 
we can broadcast these values to the whole network in 0{D + rounds. Theorem 4.1 
then implies that we have a (1 -f o(l))-approximate solution to the single-source shortest 
paths problem on the original network. The total time spent is 0{n°^^^D + ni/^'i“°(i)). By 
observing that the term will show up in the running time only when D = cx(n°*^^^), 

we can write the running time as as claimed in the beginning. 
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We thus have obtained the following result. 

Theorem 4.8. There is a deterministic distributed algorithm that, on any weighted undirected 
network, computes (1 + o{l))-approximate shortest paths between a given source node s and 
every other node in rounds. 

5 Algorithms on Other Settings 

5.1 Congested Clique 

In the congested clique model, the underlying communication network is a complete graph. 
Thus, in each round every node can send a message to every other node. Apart from this 
topological constraint, the congested clique is similar to the CONGEST model. 

We compute an o(l))-hop set on a congested clique by simulating the hop set 

construction algorithm in the same way as on an overlay network as presented in Section 4.2. 
(However, we do not compute an overlay network here.) The only difference is the number 
of rounds needed for nodes to broadcast messages to all other nodes. Consider the situation 
that m' messages are to be broadcast by some nodes. On a network of arbitrary topology, 
we will need 0{D + m!) rounds. On a congested clique, however, we only need 0{m'/n) 
rounds using the routing scheme of Dolev et al. [DLP12, Lemma 1] (also see [LenlS]): If each 
node is source and destination of up to n messages of size O(logn) (initially only the sources 
know destinations and contents of their messages), we will need 0 ( 1 ) rounds to route the 
messages to their destinations. In particular, we can broadcast n messages in 0(1) rounds, 
and thus m' messages in 0{m! jn) rounds. Using this fact, the number of rounds needed for 
the algorithm in Section 4.2 reduces from 0{RD -\- J2veV(G'') Equation (10)) to 

0{R + J2v&V{G") \C(v)\/n) = 0{R + pn^^P) = on a congested clique.^® 

Once we have an (n°^^\ o(l))-hop set, we proceed as in Section 4.3. Let H be the 
graph obtained by adding to the input graph G the edges of the o(l))-hop set. We 

can treat H as a, congested clique network with different edge weights from G. {H can 
be computed without any communication since every node already knows the hop set.) 
To (1 + o(l))-approximate d{s,v,G) for every node v in G, it is sufficient to compute the 
/i-hop distance d^{s,v, H), for some h = To do this, we follow the same approach 

for this problem as in [Nanl4, Section 5.1], where we execute the distributed version of the 
Bellman-Ford algorithm for h rounds. That is, every node u maintains a tentative distance 
from the source s, denoted by (I(s, u) and in each round every node u broadcasts 5{s, u) to 
all other nodes. It can be shown that after k rounds every node v knows the fe-hop distance 
(i.e., 6{s,u) = d^{s,v, H)) correctly, and thus after h rounds we will get the h-hop distances 
as desired. 

'^®Instead of relying on the result of Dolev et al., we can use the following algorithm to broadcast m! 
messages in 0(rn! jn) rounds as follows. We assign an order to the messages, where messages sent by a node 
with smaller ID appears first in the order and messages sent by the same node appear in any order (a node 
can learn the order of its messages after it knows how many messages other nodes have). We then broadcast 
the hrst n messages according to this order, say Mi,..., M„, where message Mi is sent to a node with the 
i**' smallest ID, and such a node sends M; to all other nodes. This takes only two rounds. The next messages 
are handled similarly. This algorithm broadcasts each n messages using 2 rounds, and thus the total number 
of rounds is 0(m! jn). 

^^Note that instead of the Bellman-Ford algorithm, one can also follow the steps in Section 4.3 instead. 
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Theorem 5.1. There is a deterministic distributed algorithm that, on any weighted congested 
clique, computes (1 + o{l))-approximate shortest paths between a given source node s and 
every other node in rounds. 

5.2 Streaming Algorithm 

In the graph streaming model, the edges of the input graph are presented to the algorithm 
in an arbitrary order. The goal is to desing algorithms that process this “stream” of edges 
using as little space as possible. In the multipass streaming model we are allowed to read 
the stream several times and want to keep both the number of passes and the amount of 
space used as small as possible. 

Our streaming algorithm for constructing an o(l))-hop set proceeds in almost the 

same way as in Section 4.2. First observe that a shortest-path tree up to distance R can be 
computed in 0{R) passes and with 0(n) space: We use the space to remember the tentative 
distances of the nodes to s, and the shortest-path tree computed so far. At the end of the 
pass we add nodes having distances exactly L to the shortest-path tree, and update the 
distance of their neighbors in the {L 1)**^ pass. 

We compute the priorities, as described in Section 4.2.2, the algorithm proceeds in 
q = phases where in the phase we compute a shortest-path tree rooted at s* up 
to distance R = on graph Gi-i (where Go = G). As in 4.4, we can construct Gi if 
we know the shortest-path tree rooted at s* up to distance R. Thus, after computing such 
a tree, the algorithm can construct Gi and proceed to the next phase without reading the 
stream. The algorithm therefore needs 0{qR) = passes and 0{qn) = space. 

To compute clusters, we compute n shortest-path trees up to distance R rooted at 
different nodes in parallel. The number of passes is clearly 0{R). The space is bounded 
by the sum of the sizes of the shortest-path trees. This is 0(X]j;ey(G") |C'('y)|) which, by 
Theorem 3.3, is 0{pn^'^^/P) = To compute the hop set we only have to compute 

clusters O(logVF) times. So, we need log IF) passes and log VF) space in 

total. By considering the edges of the hop set in addition the edges read from the stream it 
suffices to compute approximate single-source shortest paths up to hops. Using the 
rounding technique this can be done in 0{n°^^'^) additional passes. 

Theorem 5.2. There is a deterministic streaming algorithm algorithm that, given any 
weighted directed graph, computes (1 + o{l))-approximate shortest paths between a given 
source node s and every other node in log IF) passes with log VF) space. 

6 Conclusion and Open Problems 

We present deterministic distributed (1 + o(l))-approximation algorithms for solving the 
single-source shortest paths problem on distributed weighted networks and other settings. 
The efficiencies of our algorithms match the known lower bounds up to an n°^^^ factor. 
Important tools consist of a construction the hop set and a deterministic process that 
replaces the well-known (randomized) hitting set argument. Our understanding of these tools 

This gives a (1 -|- o(l))-approximate value for v, H) for every node v, which is sufficient for computing a 
(1 -I- o(l))-approximate value for d(s, v, G). This algorithm is however more complicated. 
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is still limited. We leave as a major open problem whether we can improve our understanding 
of these tools in the following aspects. 

First, the main reason that the factor shows up in the bounds of our algorithms 
is because we rely on a o(l))-hop set of size To use the same approach 

to get similar bounds but without the term, we need efficient algorithms to construct 
a (polylog(n), o(l))-hop set of size 0{n). However, such hop set is not known to exist. We 
leaving the existence of such hop set (ignoring the efficiency in constructing it) as the first 
open question.^® 

Secondly, our deterministic replacement of the hitting set argument works only when the 
input graph is undirected. Our second open problem is thus how to derandomize algorithms 
on directed graphs (where edge directions do not affect the communication; see [Nanl4, 
Nanl6] for more details). In particular, it is known that single-source shortest paths can 
be (1 -|- e)-approximated on directed weighted graphs in + D) time [Nanl4], 

and single-source reachability can be computed in + D) time [GU15]. However, 

these results are obtained by randomized algorithms, and whether there are sublinear-time 
deterministic algorithms for these problems is still open. 

Finally, while our paper essentially settles the running time for computing single-source 
shortest paths approximately, the best running time for solving this problem exactly is 
still linear (by the Bellman-Ford algorithm). Whether there is a sublinear algorithm is a 
major open problem. In fact, in the past few years we have much better understood how to 
approximately solve basic graph problems, such as minimum cut, single-source shortest paths, 
all-pairs shortest paths, and maximum flows, on distributed networks (e.g. [NS14, GK13, 
GKK+15]). However, when it comes to solving these problems exactly^ almost nothing is 
known. Understanding the complexity of exact algorithms is an important open problems. 

We refer to [Nanl4, Nanl6] for further open problems. 
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A Proof of Lemma 2.2 


To prove Inequality (2), let vr* be a shortest path between u and v in Gi. Observe that if we 
consider this path in G (with the corresponding edge weights), then its total weight is at 
most the distance between u and v in G, i.e. w{'Ki^G) > d{u,v,G), because no path in G 
can have weight less than the shortest path in G. We therefore get 


Pi ■ d{u,v,Gi) = Pi ■ w{x,y,Gi) 
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Y. pi • 

{x,y)&iTi 
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> w{x,y,G) = w{iri,G) > d{u,v,G). 
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To prove Inequality (3), let tt be a shortest h-hop path in G. Observe that w{tt, Gi) > 
d{u, V, Gi), as again no path has smaller weight then the shortest path in Gi. By additionally 
exploiting the assumption d^{u,v,G) > 2 *, we get 


d{u, V, Gi) ■ Pi < w{tt, Gi) ■ Pi 
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= w{'k,G) + |7r| • Pi 
= d^{u,v,G) + |7r| • Pi 

< d^{u, v,G) + h - Pi 
= d^{u, V, G) + e2^' 

< d^{u, V, G) + ed^{u, v, G) 
= (1 + e)d^{u,v,G). 


To prove Inequality (3), we combine Inequality (4) with the assumption d^{u,v,G) < 2®+^: 

G,) < + ^ + ^ ^ (J ^ 2/0/>.. 
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B Proof of Lemma 3.1 


We prove the claim by the probabilistic method. Consider a sampling process that determines 
a set T C [/ by adding each element of C/ to T independently with probability l/(2x). Let 
Eq denote the event that |r| > \U\/x and for every 1 < i < k let Ei denote the event 
that T n S'i = 0. First, observe that the size of T is \U\/{2x) in expectation. By Markov’s 
inequality, we can bound the probability that the size of T is at most twice the expectation 
by at least 1/2 and thus Pr[Fio] = Pr[|P| > < 1/2- Furthermore, for every I < i < k, 

the probability that Si contains no node of T is 

\ |Si| / \ 2xln3A: 

2x) V 2x) — glnSfc 3 /j • 
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The set T fails to have the desired properties of a small hitting set if at least one of the 
events Ei occurs. By the union bound we have 


Pr[ U < E < 

0<j<fc 0<i<k 


1 

2 


+ k- 


1 


1 1 


It follows that the sampling process constructed a hitting set T for C = {5i,..., 5^} of size 
at most |r| < |?7|/a: with non-zero probability. Therefore a set T with these properties must 
really exist. This finishes the proof of Lemma 3.1. 


C Ruling Set Algorithm 

For each node u, we represent its ID by a binary number viV 2 ■ ■ - vx. Initially, we set To = U. 
The algorithm proceeds for b iterations. 

In the iteration, we consider Uj for every node u G Tj-i. If Uj = 0, u remains in Tj 
and sends a “beep” message to every node within distance c — 1. This takes c — 1 rounds as 
beep messages from different nodes can be combined. If Uj = 1, it remains in Tj if there 
is no node v G Tj_i such that d{u, v,G) < c and Vj = 0; in other words, it remains in Tj if 
it does not hear any beep after c — 1 rounds. The output is T = Tx- The running time of 
the above algorithm is clearly 0(cA) = 0(c log n). Also, the distance between every pair of 
nodes in T is at least c since for every pair of nodes u and v of distance less than c, there is 
a j such that Uj ^ Vj, and in the iteration if both u and v are in Tj_i, then one of them 
will send a beep and the other one will not be in Tj. Finally, it can be shown by induction 
that after the round every node in U is at distance at most jc from some node in Tj\ 
thus it follows that every node in U is at distance at most cA from some node in T. 
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